Gene expression data mining guided by genomic background knowledge

نویسندگان

  • Pavel Smrz
  • Jana Silhavá
  • Jirí Kléma
  • Filip Zelezný
چکیده

Microarray data represents valuable information resources, nevertheless the knowledge is hidden inside the data and it is not easy to mine. Background knowledge is also stored in various formats and it is challenging to automatically infer the biological meaning from existing repositories. This paper deals with a new gene-expression knowledgefusion system that combines molecular biology data from various sources — the experiment in hand, gene expression data from similar experiments stored in array expression databases, additional knowledge on the most significant genes and their products from specialised services (e.g., pathway databases), and automatically derived results provided by relevant scientific literature. The design of the proposed system is rather complex. We take advantage of recent semantic web technologies to integrate the various modules of the system. Some of the components described in the paper have already taken part in the end-user applications, others still wait for their implementation in the form of software tools.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data Mining for Identification of Forkhead Box O (FOXO3a) in Different Organisms Using Nucleotide and Tandem Repeat Sequences

 Background: Deregulation of FOXO3a gene which belongs to Forkhead box O (FOXO) transcription factors, can cause cancer (e.g. breast cancer). FOXO factors have important role in ubiquitination, acetylation, de-acetylation, protein-protein interactions and phosphorylation. Understanding the regulation and mechanisms of FOXO3a can lead to cancer treatment. The aim of this study recent association...

متن کامل

Clinic-Genomic Association Mining for Colorectal Cancer Using Publicly Available Datasets

In recent years, a growing number of researchers began to focus on how to establish associations between clinical and genomic data. However, up to now, there is lack of research mining clinic-genomic associations by comprehensively analysing available gene expression data for a single disease. Colorectal cancer is one of the malignant tumours. A number of genetic syndromes have been proven to b...

متن کامل

A hypergraph-based learning algorithm for classifying gene expression and arrayCGH data with prior knowledge

MOTIVATION Incorporating biological prior knowledge into predictive models is a challenging data integration problem in analyzing high-dimensional genomic data. We introduce a hypergraph-based semi-supervised learning algorithm called HyperPrior to classify gene expression and array-based comparative genomic hybridization (arrayCGH) data using biological knowledge as constraints on graph-based ...

متن کامل

ADAM Gene Expression in The Adult CNS and Genetic Aberrations in Cancer Cells

ADAM metalloprotease-disintegrins share a common modular structure of functional domains for proteolytic, cell adhesion, and signaling interactions. The metalloprotease domain of oughly half of the known ADAMs contain an intact consensus metzincin catalytic site, and they are thus thought to function as active metalloproteases. The types of interactions mediated by ADAMs are expressly conspicu...

متن کامل

O-3: Drug Repositioning by Merging Gene Expression Data Analysis and Cheminformatics Target Prediction Approaches

The transcriptional responses of drug treatments combined with a protein target prediction algorithm was utilised to associate compounds to biological genomic space. This enabled us to predict efficacy of compounds in cMap and LINCS against 181 databases of diseases extracted from GEO. 18/30 of top drugs predicted for leukemia (e.g. Leflunomide and Etoposide) and breast cancer (e.g. Tamoxifen a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008